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Abstract. It was proved in the paper [KM1] that the properties of almost all points 
of M n being not very well (multiplicatively) approximable are inherited by nondegen- 
erate in M. n (read: not contained in a proper affine subspace) smooth submanifolds. In 
this paper we consider submanifolds which are contained in proper affine subspaces, 
and prove that the aforementioned Diophantine properties pass from a subspace to 
its nondegenerate submanifold. The proofs are based on a correspondence between 
multidimensional Diophantine approximation and dynamics of lattices in Euclidean 
spaces. 



1. Introduction 

We denote by M m n the space of real matrices with m rows and n columns. 
Ik G Mk,k stands for the identity matrix. Vectors are named by lowercase boldface 
letters, such as x = (xi | 1 < i < k). For x 6 I 1 we let ||x|| = maxi<i<fc \xi\. 
stands for a zero vector in any dimension, as well as a zero matrix of any size. The 
Lebesgue measure in M. k will be denoted by | ■ |. 

We start by recalling several basic facts from the theory of Diophantine approx- 
imation. For v > and m, n G N, let us denote by W v (m, n) the set of matrices 
A G M m n for which there are infinitely many qeZ" such that 

(1.1) Hq + pII < ||q|r w for some P GZ m . 

Clearly W Vl (m, n) D W V2 (m, n) if V\ <v%- We will also use the notation 

W+(m, n) = f W u (m, n) and W^m, n) = f C\ W u (m, n) . 

u>v u<v 

One knows that W n / m (m, n) = M m ^ n by Dirichlet's Theorem, and that the 
Lebesgue measure of W v (m, n) is zero whenever v > n/m due to the Borel-Cantelli 
Lemma. In particular, the set W^, m (m,n) has zero measure. Matrices from the 
latter set are called very well approximable, to be abbreviated as VWA. 

It follows from Khintchine's Transference Principle, see e.g. [C, Chapter V], 
that the statements A G W + , (m, n) and A T G W + , (n, m) are equivalent. In 
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particular, a vector y G ffi n interpreted as an n x 1 matrix is VWA iff it is VWA 
when viewed as a 1 x n matrix. Our goal in the present paper is to look at VWA 
vectors in M n , and it will be more convenient for us to use the row vector approach, 
so that, for y as above and for q G M n , yq will stand for yiqi + • ■ • + y n q n . In view 
of the aforementioned duality, this causes no loss of generality, and, hopefully, will 
cause no confusion. 

We now specialize to the case m = 1; that is, consider Diophantine properties 
of vectors (= row matrices) y G l n . Following the terminology introduced by 
V. Sprindzuk, say that a submanifold M. of M. n is extremal if almost all y G Ai 
(with respect to the natural measure class) are not VWA. In other words, if the 
property of generic y G M n being not VWA is inherited by the submanifold. Pushing 
this terminology a little further, let us say that a map f from an open subset U of 
R d to R n is extremal if f (x) is not VWA for a.e. x6f/. 

Proving extremality of smooth manifolds/maps has been one of the central topics 
of metric Diophantine approximation for the last 40 years, the major driving force 
being Sprindzuk's 1964 solution [Spl] of a long-standing problem of K. Mahler [M], 
that is, proving the extremality of the so-called rational normal or Veronese curve 



See [Sp2, Sp3, BD] for history and references. 

In his 1980 survey of the field [Sp4], Sprindzuk conjectured that a real analytic 
manifold M. is extremal whenever it is not contained in any proper affine subspace 
of W 1 . The latter condition, loosely put, says that M. 'remembers' the dimension 
of the space it is imbedded into; and the conjecture asserts that M. must also 
'remember' the law of almost all points being not VWA. 

This conjecture was proved by G. A. Margulis and the author [KM1] in a stronger 
form, with the aforementioned geometric condition replaced by an analytic one, and 
the real analytic class extended to C k for large enough k. We need the following 
definitions. Let U be an open subset of C an affine subspace of M n , and let 
f = (/i,...,/ n ) be a C k map U — > C. For / < k and x G U, say that f is 
l-nondegenerate in C at x if 



(a linear subspace £ of lR n is called the linear part of C if C = £ + y for some 
y G M n ). We will say that f is nondegenerate in C at x if (1.3) holds for some /. If 
M. is a (i-dimensional submanifold of £, we will say that M. is nondegenerate in C 
at y G M. if any (equivalently, some) diffeomorphism f between an open subset U 
of IR d and a neighborhood of y in M. is nondegenerate in C at f _1 (y). We will say 
that f : U — > C (resp. M C C) is nondegenerate in C if it is nondegenerate in C at 
almost every point of U (resp. M, in the sense of the natural measure class on M). 
One of the main results of [KM1] is the following 

Theorem 1.1. Let f : U — > W 1 , U C R. d , be a smooth map which is nondegenerate 
in R n . Then f is extremal. 



(1.2) 




(1.3) 



the linear part of C is spanned by 
partial derivatives of f at x of order up to / 
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In particular, smooth submanifolds of R n which are nondegenerate in M n are ex- 
tremal. Note that many special cases were proved before the general case; see [KM1, 
BD] for a detailed account, and [BKM, BBKM, KLW] for further developments. 

The goal of the present paper is to study manifolds for which the aforementioned 
non-degeneracy-in-M n condition fails. In fact, the simplest ones, namely proper 
affine subspaces of R n themselves, have been the subject of several papers [S2, Sp3, 
BBDD], and certain conditions have been found sufficient for their extremality. To 
the best of the author's knowledge, nobody has yet turned attention to proper 
submanifolds of affine subspaces of lR n . Let us now state one of the main results of 
the present paper, which addresses this gap. 

Theorem 1.2. Let C be an affine subspace ofW n . Then: 

(a) if C is extremal and f : U — > C, U C M. d , is a smooth map which is 
nondegenerate in C, then f is extremal; 

(b) if C is not extremal, then all points of C are VWA (in particular, no subset 
of C is extremal). 

This result generalizes Theorem 1.1, showing that the extremality of affine sub- 
spaces is inherited by their nondegenerate submanifolds. It also implies that a 
manifold nondegenerate in some affine subspace of M. n is extremal if and only if this 
subspace contains at least one not very well approximable point. Cf. a similar state- 
ment conjectured by B. Weiss in the context of interval exchange transformations 
and Teichmiiller flows [W, Conjecture 2.1]. 

The proof is based on the methods of [KM1], that is, on the correspondence 
between approximation properties of vectors and trajectories of lattices in Euclidean 
spaces. Necessary background is reviewed in §2. Then in §3 we use the language 
of lattices to give a necessary and sufficient condition for a map f : U — > M n , 
U C M. d , within a certain class of good maps to be extremal, and then show that 
this condition is inherited by nondegenerate submanifolds of affine subspaces. 

Dealing with an s-dimensional affine subspace of M n , one can be more specific 
and phrase the aforementioned condition in terms of coefficients of an affine map 
parametrizing the subspace. By permuting variables one can without loss of gen- 
erality choose a parametrizing map of the form x i— > (x,xA' + a ), where A' is a 
matrix of size s x (n — s) and a G IR n_s (here both x and a are row vectors). In 
an even more abbreviated way, we will denote the vector (1, x\, . . . , x s ) by x, and 



We show in §4 how the results of §3 allow one to write down a condition on A 
(see Theorem 4.3) equivalent to the extremality of the map (1.4). On the other 
hand, it easily follows from the definitions, as explained in §4, that every point of L 
parametrized by (1.4) is VWA whenever A belongs to W+(s + 1, n — s). We show 
that the converse is also true in the following two cases, and by the following two 
methods: 



the matrix 




by A G M s _|_i n _ s ; then C is parametrized by 



(1.4) 



x i— > 



(x, SL4) . 
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(1.5) 



s = n — 1 (that is, C is an affine hyperplane) , 



— as a consequence of Theorem 4.3, and 

s = 1 and A is of the form ( , ] for a row vector b 

(1.6) VV 

(that is, £ is a line passing through the origin) . 

- using an argument borrowed from [BBDD]. In other words, the following can be 
proved: 

Theorem 1.3. In the two special cases (1.5) and (1.6), the map (1.4) is extremal 
if and only if 

(1.7) A£ W+(s + l,n-s). 

Whether the same is true for an arbitrary affine subspace is not clear. Since 
matrices A as above provide local coordinate charts to the set of s-dimensional 
affine subspaces of lR n , and in view of M. Dodson's [Do] formula for the Hausdorff 
dimension of the sets W v (m, n), the affirmative answer to the above question would 
imply that the dimension of the (null) set of non-extremal s-dimensional affine 
subspaces of R n is equal to 

(1.8) dim(>V+(s + l,n-s)) = (n - s - l)(s + 1) + 1 , 

which is precisely 1 + the Hausdorff dimension of the set of 'rational' s-dimensional 
subspaces, i.e. of the set 

{A G M S+1:U _ S | Aq + p = for some p G Z s+1 , q G Z n ~ s \ {0}} . 

Other open problems and generalizations are discussed in the last two sections 
of the paper. This includes the so-called multiplicative modification of the standard 
set-up, which is the subject of §5. Namely, there we define not very well multiplica- 
tively approximable (not VWMA, a property stronger than 'not VWA' but still 
generic in R n ) vectors and strongly extremal manifolds (i.e. those for which almost 
all points are not VWMA). It was proved in [KM1] that smooth nondegenerate 
submanifolds of R n are strongly extremal; we generalize this as follows: 

Theorem 1.4. Let C be an affine subspace ofM. n . Then: 

(a) if C is strongly extremal and f : U — > C is a smooth map which is nonde- 
generate in C, then f is strongly extremal; 

(b) if C is not strongly extremal, then all points of C are VWMA (in particular, 
no subset of C is strongly extremal). 

Similarly to Theorem 1.2, this is done by writing down a necessary and sufficient 
condition (see Theorem 5.3) for a good map to be strongly extremal, and then 
showing that condition to be inherited by nondegenerate submanifolds of affine 
subspaces. Following the lines of §4, we are able to simplify that condition in the 
case (1.5), thus explicitly describing strongly extremal hyperplanes and identifying 
those which are extremal but not strongly extremal. Whether this can be extended 
beyond the codimension one case is an open question. 
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2. DlOPHANTINE APPROXIMATION AND LATTICES 



In this section we introduce some notation and terminology which will help us 
work with discrete subgroups T of M fc , fceN. We define the rank rk(T) of V to be 
the dimension of Mr. Also define S(T) to be the norm of a nonzero element of V 
with the smallest norm, that is, 

6(T) = f inf II v II . 

verx{o} 

For < j < k, let us denote by Skj the set of all subgroups of Z fc of rank j, 
and by Sk the set of all nonzero subgroups of Z h of rank smaller than k, that is, 

'-'A: — U j = l<->A;,j- 

It will be useful to consider exterior products of vectors generating V. Namely, 
if T G Skj, say that w G f\ J (R h ) represents V if 



w = 



1 if .7 = 

Vi A • • • A Vj if j > and v 1? . . . , Vj is a basis of V . 



Clearly the element representing Y is defined up to a sign. With some abuse of 
notation, we will also denote by Skj and Sk the set of w G A(^) representing 
T G Sk,j and G Sk respectively. 

The set of all lattices (discrete subgroups of maximal rank) in M, k of covolume 
one can be identified with the homogeneous space SLfc(IR)/ SLfc(Z), which we will 
denote by f^. It is a noncompact space with finite SLfc(lR) -invariant measure, and 
the restriction of the function 5(-) defined above to this space can be used to describe 
its geometry at infinity. Namely, Mahler's Compactness Criterion [R, Corollary 
10.9] says that a subset of Qk is relatively compact if and only if 5 is bounded away 
from zero on this subset. Further, it follows from the reduction theory for SLfc(Z), 
see e.g. [Si, Satz 4], that the ratio of 1 + log (l/5(-)) and 1 + dist(-, Z k ) is bounded 
between two positive constants for any right invariant Riemannian metric 'dist' on 
the space of lattices. In other words, a lattice A G flk for which 5(A) is small is 
approximately log (l/5(A)) away from the base point Z fc . The reader is referred to 
[Kl] for more details. 

This justifies the following definition: for 7 > and any one-parameter semi- 
group F = {g t I t > 0} acting on O^, say that the F-trajectory of A G grows 
with exponent > 7 if there exist arbitrarily large positive t such that 

S(g t A) < e-* . 

Also define the growth exponent 7f(A) of A with respect to F to be the supremum 
of all 7 for which the -F-trajectory of A grows with exponent > 7. In view of the 
preceding remark, one has 

dist(gtA,Z™) 
7 F (AJ = limsup . 

t^oo t 

Now let us describe a correspondence, dating back to [S3] and [D], between 
approximation properties of vectors y G M n and dynamics of certain trajectories in 
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n n+i . Given a row vector yeR™ one considers a lattice u y Z n+1 in IR n+1 , where 

u y d = ^ q / ) ' ^^ a ^ * S ' ^ e couect ^ on °^ vect ors of the form ^ ^ ^ > where 

p G Z and q G Z n . Then one reads Diophantine properties of y from the behavior 
of the trajectory Fu y Z n+1 , where 

(2.1) F = {g t \t>0}, with ^ = diag(e t ,e- t / n ,...,e-*/ n ), 

is a one-parameter subsemigroup of SL n +i(R) which expands the first coordinate 
and uniformly contracts the last n coordinates of vectors in R n+1 . 

The passage from Diophantine approximation to growth exponents of trajectories 
will be based on the following elementary lemma: 

Lemma 2.1. Suppose we are given a set E C M 2 which is discrete and homoge- 
neous with respect to positive integers, that is, kE C E for any k G N. Also take 
a,b > 0, v > a/b, and define 7 by 

, bv — a a + 7 

2.2 7 = u= T -. 

v ' v + 1 6-7 

T/ien £/je following are equivalent: 

[2.1-i] i/iere exist (x, z) G -E 1 witt arbitrarily large \z\ such that \x\ < \z\~ v ; 
[2.1-ii] there exist arbitrarily large t > such that for some (x,z) G E \ {0} one 

(2.3) uvd^(e at \xle- ht \z\) <e- lf . 



Proof. Assume [2.1-i], take (x,z) G E with \x\ < \z\~ v , and define t by e _6 *|z| = 
e~ 7t , that is, \z\ = e^ & ~ 7 ^. (Note that it follows from (2.2) that 7 < b.) Then one 
has 

e at \x\<e at \z\- v =e at (e {b -~ i)t )- v = e" 7 *, 

(2.2) 

that is, (2.3) holds for this choice of x, z and t. Taking \z\ arbitrarily large produces 
arbitrarily large t as well. 

Assume now that [2.1-ii] holds. Then one can find a sequence t n — > 00 and 
(x n , z n ) £ E \ {0} such that 

(2.4) e atn \x n \ < e~ lU and e" w " |z n | < e" 7 *" , 

and write 

|x n | < e - ^" 1 " 7 - 1 * 11 = e~ v( - b ~' 1 ' )tn < \z n \~ v . 

(274) (2.2) (2T4) 

If the sequence {z n } is unbounded, [2.1-i] is proved. Otherwise, note that x n — > 
due to (2.4); by the discreteness of i?, the sequence {(x n ,z n )} must stabilize, and 
thus one has (0, z) G E for some z > 0. But then (0, fcz) G -E 1 for any k G N by the 
homogeneity, and the proof of [2.1-i] is finished. □ 
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Corollary 2.2. y G W v (l,n) iff the growth exponent 7 F (u y Z n+1 ) of u y Z n+1 with 
respect to F as in (2.1) is not less than 7, the latter being defined by 

, n v - n n{l + 7) 

(2.5) !=—< 77 v = -T -• 

v ; ' n(u + l) 1-77,7 

Proof. This corollary is in fact a special case of Theorem 8.5 from [KM2]. However 
one can easily derive it from the previous lemma by taking a = 1, b = 1/n and 

£ = {(yq + P, llqll) I (P,q)e2" +1 }, 
and noticing that the inequality 

(2.6) 5ig t u y Z n+1 ) < e"T* 

amounts to the validity of (2.3) for some (x, z)e£ \ {0}. □ 

Corollary 2.3. TTie following are equivalent for y G M n and F as m (2.1): 

[2.3-i] y is VWA; 
[2.3-ii] 7 F (w y Z n+1 ) > 0; 

[2.3-iii] for some 7 > there exist infinitely many t G N swc/i £/ia£ (2.6) holds. 

Proof. The equivalence of [2.3-i] and [2.3-ii] is straightforward from Corollary 2.2 
and (2.5), while to derive [2.3-iii] one notices that the ratio of d(gt-) and 8(gt>-) is 
uniformly bounded from both sides when \t — t'\ < 1. □ 

We return now to the setting of the Diophantine approximation on subsets of 
]R n . More precisely, we consider a map f = (fi,...,f n ) '■ U — > M n , where U 
is an open subset of M d , and study Diophantine properties of vectors f(x) for 
a.e. x G U. This calls for considering the corresponding map from U into O n+ i, 
namely x 1— > -u f ( x )Z n+1 , where 

(0 7^ „ d _ef (1 f(x) 

(2.7) u f(x) - ^ ^ 

and then looking at growth of trajectories of lattices w f(x )Z n+1 under the action of 
g t as in (2.1). 

In the next section we will describe a method, introduced in [KM1], which is 
based on keeping track on what happens to every subgroup T of Z n+1 under the 
action by «f( x ) and then by g t . Fix a basis e ,ei,...,e n of IR n+1 , and for / = 
{ii, . . . C {0, . . . ,n}, i\ < 12 < • ■ • < ij, let 

ei d = f eil A-.-Ae*. G /\ J (IR n+1 ) , 

with the convention e = 1. We extend the norm || • || from R n+1 to the exterior 
algebra /\(IR n+1 ) by || X]/c{i j} w i e i\\ = max /c{o,...,n} \ w i\- Thus it makes sense 

def 

to define the norm of V as above by ||r|| = ||w||, where w represents V. Note 
that the ratio of ||r|| and the volume of the quotient space Rr/r is uniformly 
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bounded between two positive constants (depending on n and on the choice of the 
norm). Also note that it follows from Minkowski's Theorem that 5(F) must be 
small whenever ||r|| is small; more precisely, for any j > there exists a positive 
constant c(j) such that 

(2.8) 5(F) < c(j)\\n 1/j 
for any F of rank j. 

As a preparation for the next section, let us write down a formula for <7titf( x )W, 
where w represents a subgroup F of Z n+1 . Note that the action of itf( x ) leaves eo 
invariant and sends e« to + /i(x)e , i = 1, . . . , n. Therefore 

J e, if G / 

I e 7 + E i6/ (-l)^ / ' i )/,(x)e JU{0} ^ 0} otherwise, 

where one defines 

/(/, i) = f the number of elements of I strictly between and i. 
Taking w of the form ^ 7 W/ej, one gets 

Mf(x)W = K + ^2(-l) l{I,t) w IU{i} ^ {t)} fi(x))e I + ^2w I e I , 
oe/ ;£/ 00 

and, further, for w G A J (^ n+1 ), 

(2.9) y t u f ( x )W = e^ t ^(w,+^(-l) l(; ' !) w IU{ , h{0) / l (x))e / +e-» t ^w I e I . 

oe/ i£/ o^/ 
What is important here is that each of the coordinates of gtUf^w is expressed as 
a linear combination of functions 1, fi, . . . , f n . 

3. EXTREMALITY CRITERIA FOR GOOD MAPS 

Let us recall the definition introduced in [KM1]. If C and a are positive numbers 
and V a subset of R d , let us say that a function / : V — > R is (C, a)-good on V if 
for any open ball B C V and any e > 0, one has 

(3 - 1} \{x e B | \f(x)\ < e ■ sup |/(x)|}| < Ce a \B\ . 

xEB 

See [KM1, BKM] for various properties and examples of (C, a)-good functions. One 
property will be particularly useful: it is easy to see that 

(3.2) fi, i e /, are (C, a)-good on V =3- so is sup . 

iei 

Now let f = . . . , / n ) be a map from an open subset [/ of R d to R n . We 
will say that f is good at xo G £7 if there exists a neighborhood V C C7 of xo and 
positive C, a such that any linear combination of 1, fi, . . . , f n is (C, a)-good on V. 
We will say that f : U — > R n is good if the set of xo G U such that f is good at xo 
has full measure. Note that C, a do not have to be uniform in x G U ; however, 
once 7 9 x is chosen, every function of the form / = cq + Ci/i + • • • + c n f n must 
satisfy (3.1) for some uniformly chosen C and a. 

Recall (see [KM1, Lemma 3.2]) that the basic example of (C, a)-good functions 
is given by polynomials: any polynomial map f : R d — > R n is good at every point of 
R d . A more general class of examples is given by linear combinations of coordinate 
functions of nondegenerate maps: 
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Proposition 3.1 [KM1, Proposition 3.4]. Let f = (fi, ■ ■ ■ , f n ) be a smooth map 
from an open subset U of M, d to R n which is l-nondegenerate in W 1 at xq G U . 
Then there exists a neighborhood V C U of x and positive C such that any linear 
combination of 1, /i, . . . , f n is (C, l/dl)-good on V . 

In other words, f is good at every point at which it is nondegenerate in M. n . From 
this one easily derives 

Corollary 3.2. Let C be an affine subspace of W 1 and let f = (fi,---,f n ) be a 
smooth map from an open subset U of M d to C which is nondegenerate in C at 
xo G U . Then f is good at xo- 

Proof. Put dim(£) = s, choose any affine map h from M s onto £, and define 
g = (<7i, . . . , g s ) by g = h -1 o f . It follows from the nondegeneracy of f in £ that 
g is nondegenerate in IR S at xo, hence, by Proposition 3.1, it is good at xo. To 
finish the proof it suffices to observe that any linear combination of 1, fi, . . . , f n is 
a linear combination of 1, gi, . . . ,g 8 . □ 

Corollary 3.3. Let f be a real analytic map from a connected open subset U of~§L d 
to MJ 1 . Then there exists an affine subspace C of M, n such that f is nondegenerate 
in C at every point of U ; consequently, f is good at every point of U . 

Proof. For any x G U, denote by £o( x ) the linear space spanned by all partial 
derivatives of f at x, and put £(x) = f(x) + £o( x )- Then for any xo,x G U such 
that the Taylor series of f centered at x converges at x, one has f(x) G £(x ) and 
£(x) C £(x ). Since U is connected, for any x' G U one can find a finite sequence 
xi, . . . , Xfc = x' such that the Taylor series of f centered at Xj_i converges at Xj for 
alH = 1, . . . , k. Therefore £(x') = £(x ), and, by reversing the roles of x and x', 

one sees that C = £(x) is independent of x G U. It remains to notice that from 
the construction it follows that f is nondegenerate in C at every x G U, and apply 
Corollary 3.2. □ 

Note that it follows from the proof that C can be defined as the intersection of 
all the affine subspaces of R n containing f(U), or, equivalently, as 



for any x G U. 

Example 3.4. It is instructive for better understanding of the class of good maps 
to remark that the assumption of the analyticity of f cannot be dropped. Indeed, 
let us sketch a construction of a C°° function from [0, 1] to M + which is not good 
on a subset of [0, 1] of positive measure. First for every J = (a, b) C [0, 1] define 



One can easily verify that for every neighborhood V of either a or b it is impossible 
to find C, a > such that ipj is (C, a)-good on V. Then consider a Cantor set 
K C [0, 1] of positive measure, and for k G N let Jk be the collection of disjoint 
subsegments of [0, 1] thrown away at the /cth stage of the construction of K. (For 



f (x ) + Span {f (x) - f (x ) | x G U} 



ipj{x) = <p(x — a)<f(b — x) , where <p(x) 
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example, one can divide every interval left at the kth stage onto 3 fc+1 equal pieces 
and then throw away the middle interval.) After that define 



oo 



k=i jej k 



where Ck decays fast enough as k — > oo to guarantee that ip is C°° (in the afore- 
mentioned example, one can take Cf~ = 3 -3 ). Since every neighborhood of every 
point of K contains an endpoint of J G Jt for some k G N, it follows that ip is not 
good at any x G K. 

We now state an estimate from [KM1], which will be used to derive a criterion 
for the extremality of f once the latter is chosen within the class of good maps. It 
will be convenient to use the following notation: if B = S(x, r) is a ball in R d and 
c > 0, we will denote by cB the ball S(x, cr). 

Theorem 3.5 (cf. [KM1, Theorem 5.2]). For any d,k G N there exists a positive 
constant C (explicitly estimated in \KM1]) such that the following holds. Given 
C, a > 0, < p < 1/k, a ball B C R d and a continuous map h : 3 k B — > GL fc (M), 
let us assume that 

[3.5-i] for any V G Sk, the function x i— > ||/i(x)r|| is (C, a)-good on 3 k B, and 
[3.5-ii] for any V G Sk, sup x6B ||^(x)r|| > p. 

Then for any positive e < p one has 



Informally speaking, the conclusion of the above theorem says that the 'orbit' 
{/i(x)Z fc | x G B} C 0^ 'does not diverge', that is, its very significant propor- 
tion (computed in terms of Lebesgue measure on B) stays inside compact sets 
{A G O fc | 5(A) > e}. We remark that such nondivergence results have a long his- 
tory, dating back to the work of Margulis [Ma] in the 1970s, and many applications 
in the theory of dynamics on homogeneous spaces, see e.g. [KSS, Chapter 3] for a 
historical account. 

The next lemma sharpens [KM1, Theorem 5.4], giving a condition sufficient for 
the extremality of a good map f . 

Lemma 3.6. Let B be a ball in M, d , and let f = (f±, . . . , f n ) be a continuous map 
from 3 n+1 B to R n . Suppose that: 

[3.6-i] 3 C, a > such that any linear combination of 1, f\, . . . , f n is (C, a) -good 
on 3 n+1 B; 

[3.6-ii] for any (3 > there exists T = T{j3) > such that for any t > T and any 
T G S n+ i one has 




sup ||sr t M f(x) r|| > e 



-pt 



where itf( x ) is as in (2.7) and F = {g t } is as in (2.1). 
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Then f(x) is not VWA for a.e. x G B. 

Proof. We apply Theorem 3.5 with k = n + 1 and /i(x) = gtUf^y Our goal is 
to show that for any 7 > 0, the set {x G B \ ^ F (wf( x )Z n+1 ) > 7} has measure 
zero. As was observed in the preceding section, see (2.9), for every w G 5 n +i all 
the coordinates of /i(x)w are linear combinations of 1, fi, . . . , / n , which, by (3.2), 
implies [3.5-i]. Now choose any j3 < 7, take t > max ((T((3), log(n + l)//9) and put 
p = e _/3t . This guarantees p < 1/k and verifies condition [3.5-ii]. Thus, for t as 
above, the measure of each of the sets 

(3.3) {xeB|% t w f(x) Z™ +1 )<e-^} 

is not greater than CC'e""' 7- ^'. One then applies the Borel-Cantelli Lemma to 
conclude that almost every x G B belongs to at most finitely many of the sets (3.3) 
with t G N, which completes the proof in view of Corollary 2.3. □ 

The next lemma shows that assumption [3.6-ii] is in fact necessary for the ex- 
tremality of f; furthermore, the consequences of [3.6-ii] being not true are much 
stronger than positive measure of the set {x G B | f(x) is VWA}. 

Lemma 3.7. Let B be a ball in M, d , and let f be a map from B to MJ 1 such that 
[3.6-ii] does not hold. Then f (x) is VWA for all x E B. 

Proof. The assumption says that there exists f3 > such that one has 

sup ||0 t Uf( x )r|| < e' 131 

for arbitrarily large t (and V e S n+ i dependent on t). In view of (2.8), for any 
xeB this implies 

5(g t u nx) Z n+1 ) < c(j)\\g t u fix) T\\^ < c{j)e~^\ 

where j is the rank of T. Hence 7f (wf( x )Z n+1 ) > /3/n, and an application of 
Corollary 2.3 finishes the proof. □ 

We now combine the two lemmas above to obtain the desired extremality crite- 
rion. 

Theorem 3.8. Let U be an open subset JBL d , and let f be a map from U to M n 
which is continuous and good. Then the following are equivalent: 

[3.8-i] the set {x e U \ f(x) is not VWA} is dense in U; 
[3.8-ii] f is extremal (that is, the above set has full measure); 

[3.8-iii] for a.e. x G U and any r > there exists a ball B C U centered at x of 

radius less than r satisfying [3.6-ii]; 
[3.8-iv] any ball B C U satisfies [3.6-ii] . 

Proof. Obviously [3.8-ii] [3.8-i] and [3.8-iv] =>• [3.8-iii]. The implication 
[3.8-i] =>• [3.8-iv] is immediate from Lemma 3.7. Assuming [3.8-iii] and using the 
fact that f is good, for a.e. x G U one finds a ball B centered at x such that 
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the dilated ball 3 n+1 B is contained in U, and both [3.6-i] and [3.6-ii] hold. Thus 
Lemma 3.6 applies, and [3.8-ii] follows. □ 

We remark that for the equivalence of [3.8-i] and [3.8-ii] it is essential that f is 
chosen within the class of good maps. Indeed, one can consider the map f(x) = 
(x, ifj(x)\ with ij; from Example 3.4. Clearly f (x) is VWA for any x from K, which 
is assumed to have positive measure. On the other hand, the restriction of f to any 
J G Jk, k G N, is nondegenerate in M 2 , hence the set {x G (0, 1) | f (x) is not VWA} 
has full measure in (0, 1) \ K, and the latter is dense in (0, 1). 

Our next task is to rephrase [3.6-ii]. For any B C U let us denote by Tb the 
IR-linear span of the restrictions of 1, /i, . . . , f n to B. Then, for any ball B C U, let 
us assume that functions gi, . . . , g s : B — > R are chosen so that 1, g±, . . . , g s form a 
basis of Tb (here the dimension s + 1 of Tb may depend on B). Further, the choice 
of functions 1, g±, . . . , g s defines a matrix P G M s _|_i )n _|_i formed by coefficients in 
the expansion of 1, /i, . . . , f n as linear combinations of 1, gi, ... , g s . In other words, 

with the notation f = f (1, /i, . . . , / n ) and g = f (1, g±, . . . , ,g s ), one has 

(3.4) f(x)=g(x)P VxGB. 

This way, the restriction of f to B is described by two pieces of data: the (s + 1)- 
tuple g and the matrix P. We now proceed to show that, assuming the map f is 
good (which is an assumption involving g), a criterion for its extremality can be 
written in terms of P. 

Indeed, any / G Tb can be written as /(x) = g(x)v for some v G M n+1 , and 
because of the linear independence of components of g over R, the 'supremum- 
on-S' norm of /, that is, / i— > sup x6B |/(x)|, is equivalent to ||v||, the constant 
in the equivalence depending on B and the choice of g. Now recall that for any 
w = J2i w i e n the Jth component of ttf( x )W is equal to w x if ^ /, and to 

Wl + ^(-l) , ( / ^ IU{ , W0} / j (x) 

i<£I 

if / contains 0. It will be convenient to simplify the latter expression by introducing 
the following notation: given / C {0, ...,n} containing with |/| = j, and an 
element w = J2i W/ e i of f\ J (M n+1 ), let us define a vector c / w G R n+1 by 

(3.5) c JiW d = (- 1 ) i(7 ' i) « ; iu {i} ^ { o } ei = «; I e + ^(-l) i(/ ' i) w IuO} ^ {0} e l . 

i^(/x{0}) i£I 

Then the nonconstant components of Wf( x ) w can be written as f (x)c 7 w = g(x)Pc 7 w : 
therefore the 'supremum-on-S' norm of each of the functions can be replaced by 
H-Pcj iW ||. Modifying all the norms and using (2.9), one replaces sup x€B ||fi r tWf( x ) w || 

by max (e + ™ "* maxoe/ ||Pc 7jW ||, e""'maxo^ \ w i\)- We summarize the above dis- 
cussion as follows: 
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Proposition 3.9. Let B be a ball in R d , f a map from B to R n , {1, g±, . . . , g s } a 

basis of Tb, o,nd P a matrix satisfying (3.4). Then [3.6-ii] is equivalent to 

V/3>0 3T>0 such that Vt>T, Vj = l,...,n andVw eS n+hj 



(3.6) 



one has max (e + ™ 3 1 max ||Pc 7 w ||, e ^maxli^l) > e ^ 



In other words, we have shown that the extremality of a continuous good map 
f : U — > R n is equivalent to the validity of certain Diophantine conditions involving 
matrices P 'coordinatizing' f \b- These conditions will be made more precise in the 
next section, and Theorem 1.2 will be obtained as a corollary. 

4. Extremality criteria for affine subspaces 

Note that in general, as was mentioned above, it may not be possible to choose 
the same matrix P uniformly for all balls B in U. Let us now consider an important 
special case when this is possible: 

Theorem 4.1. Let (7cK d be an open subset, and let g = (<7i, . . . , g s ) : U — > R s 

be a continuous good map such that 

Vx e U (equivalently, Vx from a dense subset ofU) 
^ ^ the germs of 1, g\, . . . , g s at x are linearly independent over R. 

Also fix P G M s +i jn +i, and let f be given by (3.4). Then (3.6) is equivalent to each 
of the following conditions: 

[4.1-i] the set {x e £7 | f(x) is not VWA} is non-empty; 
[4.1-ii] f is extremal (that is, the above set has full measure). 

Proof. It is clear from (3.4) that f is also continuous and good. [4.1-ii] follows from 
(3.6) in view of Proposition 3.9 and the implication [3.8-iii]=^[3.8-ii]. On the other 
hand, if (3.6) is violated, Lemmas 3.7 and Proposition 3.9 imply that every point 
of U has a neighborhood B such that f(x) is VWA for all x e B, contradicting 
[4.1-i]. □ 

Informally speaking, the assumption of Theorem 4.1 says that f is not 'assembled 
from several pieces unrelated to each other'. Without that assumption, one can 
easily construct examples of good continuous (and even C°°) maps f satisfying 
[4.1-i] but not [4.1-ii]. 

Now suppose that f is real analytic and U is connected, or, more generally, that 
f is nondegenerate in some affine subspace C of R n . Then one can easily find s < n 
and a good s-tuple g satisfying (4.1). Specifically, one takes s = dim(£) and, as in 
the proof of Corollary 3.2, defines g to be equal to h _1 o f where h is any affine 
map from R s onto C. Furthermore, one easily recovers the corresponding matrix P 
by writing h in the form 

(4.2) h(x)=xP, 

where as usually we have h = f (1, h\, . . . , h n ). It follows that for fixed £, any f 
which is nondegenerate in C will satisfy the assumptions of Theorem 4.1 with some 
uniformly chosen P. In particular, Theorem 4.1 applies to the map f = h given by 
(4.2), that is, to the subspace C itself. Thus we have proved: 
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Theorem 4.2. Let C be an s- dimensional affine subspace ofW 1 parametrized as in 
(4.2) with P G M s+ i jn+ i. Then each of the following conditions below is equivalent 
to (3.6): 

[4.2-i] C contains at least one not very well approximable point; 
[4.2-ii] C is extremal; 

[4.2-iii] any smooth submanifold of C which is nondegenerate in C is extremal. 

One then recovers Theorem 1.2 as the implications [4.2-ii] =^ [4.2-iii] and 
[4.2-i] =>■ [4.2-ii] above. Note also that Theorem 1.1 is obtained by taking s = n 
and P = I n +i- 

Now recall that, as described in the introduction, one can by permuting variables 
without loss of generality parametrize C by (1.4) for some A G M s+1)n _ s ; that is, 
take P of the form P = (I s +i A). In order to restate Theorem 4.2 in terms of 
A, let us denote by c+ w (resp. c~ w ) the column vector consisting of the first s + 1 



1 ,w 
J.w 



(resp. the last n — s) coordinates of c 7 w . In other words, we have c jw = 
where 

<w= Yl (-l) l{I ' l) w IU{l} ^ {()} e l = w I e + (-I^'^/ummo^ 

ie{o}u({i,...,«}xJ) ie{i,..., s }x/ 

and 



J.w 



i€{s+l,...,n}\7 



Then one has: 



Theorem 4.3. Let C be an s-dimensional affine subspace of W 1 parametrized as 
in (1.4) with A G M s+ i in _ s . Then the following are equivalent: 

[4.3-i] C is extremal [4.2-i] and [4.2-iii] hold); 

[4.3-ii] for any (3 > there exists T > such that for any t > T, j = 1, . . . , n and 
w G S n +i,j one has 

, n + l—j f 1 1 I . _ ,, —J-f I |\ ^ — Bf 

max e ™ max cT +Ac r , e ™ max to, >e , 

[4.3-iii] Vj = l,...,n and Vt> > n+ j~ J 3iV > swc/i that for any w G «S , n +i,.j 
with max ^/ > AT, one /ias 

max ||c+ + Ac7 w II > (max \w I \)~ v . 
oei ' og/ 



Proof. The preceding argument shows the equivalence [4.3-i] [4.3-ii], while the 
fact that [4.3-ii] and [4.3-iii] are equivalent is a special case of Lemma 2.1, where 
for each j = 1, . . . , n one considers 

E= {(mjK||c+ w + Ac- w ||,max|«; 7 |) | w G S n+1 j} . □ 
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It is instructive to write down a special case of the above inequality corresponding 
to j = 1. That is, let us take v G Z n+1 \ {0} = <S n +i 5 i in place of w; one sees that 
the only one-element subset I of {0, . . . , n} for which c 7 v is defined is I = {0}, and 

it easily follows from (3.5) that c {0}>v = v. Writing v = (^^J > where p G Z s+1 and 

q G Z n_s , one gets c| 0} v = p and c^ 0}v = q. Then denoting by p' the vector with 
components pi, . . . ,p s , one writes the j = 1 case of [4.3-iii] as follows: 

[4.3-iii]j=i for any v > n there exist at most finitely many q G Z n_s such that for 
some p G Z s+1 one has 



(4.3) 



|p + ^4q|| < 



Now observe that one can safely replace the latter inequality by 
(4-4) l|p + ^q|| < NIP , 

perhaps slightly changing v. Indeed, (4.4) clearly follows from (4.3). On the other 
hand, (4.4) implies that ||p|| < C||q|| for some C dependent only on A; thus, for 
a slightly smaller v and large enough ||q||, (4.3) would follow. We arrive to the 
conclusion that [4.3-iii] J= i is equivalent to (1.7). 

However, let us point out that one does not need the full strength of Theorem 
4.3 to see that (1.7) is one of the conditions necessary for the extremality of L as 
in (1.4). Indeed, as shown above, the assumption A G W+(s + 1, n — s) amounts to 
the existence of v > n such that for infinitely many q G Z n_s one can find p G Z s+1 
satisfying (4.3). Then one can take any x G IR S and write 

\p + (x,5L4) ^ ^ | = |;po + xp' + xAq| = |x(Aq + p)| < (s + 1) ||x|| ||Aq + p||. 
Slightly decreasing v if needed, one gets infinitely many solutions of 



p„ + (x,xA)^J < 

that is, (x, xA) is proved to be VWA for all x. 

Let us now ask the following question: could it be the case that the remaining 
n — 1 conditions of Theorem 4-3 are redundant, that is, follow from [4.3-iii] j = i? The 
affirmative answer to this question would provide a very easy to state extremality 
criterion, i.e. the validity of (1.7), for affine subspaces and their submanifolds. 

The answer to this question is (in general) not known to the author. However, 
the next result shows that the case j = n of [4.3-iii] is indeed redundant. 

Lemma 4.5. For any s = 1, . . . , n — 1, any A G M s+ i )n _ s and any w G iS n -|_i jTJ , 
one has 




(4.5) 



Proof. Denote by Ji the set {0, . . . , n} \ {i}. Then any w e 5 n +i >n can be written 
in the form w = Yl7=o W i e -Ji^ an d from (3.5) it follows that for i = 1, . . . , n one has 

(4.6) Cj. tm = w i eo + (-iy~ 1 woe i . 

Therefore for any i = 1, . . . , s one has cj = 0, and hence 

c X,w + 4cj. >w = c+ w = Wl e + (-iy- 1 w e l . 

Consequently, (4.5) is satisfied whenever w ^ 0. On the other hand, w = 0, in 
view of (4.6), implies that c~ w = for any i. Taking i > s for which Wi ^ 0, one 
S ets ll c X,w + 4c- w || = \wi\. □ 

This, in particular, gives an affirmative answer to the above question in the case 
n = 2: a line in R 2 given by y = ciq+ciix is extremal if and only if ( a ° ) ^ W2 (2, 1). 



ai ^ 

It turns out that an argument similar to the proof of Lemma 4.5 produces an 
analogous extremality criterion for (n — 1) -dimensional affine subspaces of W 1 for 
arbitrary n: 

Lemma 4.6. Let A e M Uj i 6e ^zven 6y a column vector a e ffi n (this corresponds 
to s = n — 1). Then (4.5) holds for any w e «S n +i jJ - wt/i j > 1. 

Proof. Our choice of s implies that for any w e «S n +i ji? - and any / 3 of size j, the 
vector C7 w consists of a single component, namely 

/ w i otherwise. 
Therefore for 0, n G / one can write 



7 ,w 



+ ac /,w = c tw = «^ e o + X] (-l)' (/ ' l) W/u{ l} ^{o}ei. 

iG{l,...,n-l}\7 



Consequently, (4.5) is satisfied whenever w ; 7^ for some I containing n. (Here we 
use the fact that j > 1: indeed, such an / must also contain some i = 0, . . . , n — 1, 
and thus the ith coordinate of c| u{0} ^ {i} w is equal to w 7 .) On the other hand, the 
assumption w, = for all J 9 n, in view of (4.7), implies that cj = for any /; 
thus (4.5) is satisfied again. Hence one can take an arbitrary / for which u) ; 7^ and 
observe that for any t 6 I, the absolute value of the ith coordinate of c+ u{0} ^ {i} w 
is equal to {w^. □ 

Combining the above lemma with Theorem 4.3 and the equivalence of [4.3-iii] J= i 
and (1.7), one easily obtains Theorem 1.3 under assumption (1.5). In particular, 
in view of (1.8), one sees that the set of a e W 1 for which the (n — 1) -dimensional 
affine subspace of M n parametrized by 

(4.8) x = (xi, . . . , x n -i) ^ (x, xa) = (xi, . . . , x n _i, a + a\x\ H h a n _ix n _i) 

is not extremal, has Hausdorff dimension 1. 
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As was mentioned above, we are unable to show the equivalence of (1.7) and 
[4.3-iii] in the general case. However, let us now turn to assumption (1.6), under 
which that equivalence can be demonstrated by a direct proof (that is, without a 
reference to lattices). 

Until the end of this section, let us assume s = 1, take A of the form ( ?) for a 



x b 7 

row vector b = (bi, . . . , 6 n _i) G M n_1 , and let C be parametrized by (1.4); that is, 
£ is a line passing through the origin given by 

(4.9) x i— > (x, bix, . . . , b n -ix) . 

It is clear that A G W+(2, n — 1) if and only if b G W+(l, n — 1). Therefore in order 
to prove Theorem 1.3 assuming (1.6), it suffices to prove the following 

Proposition 4.7. A line given by (4.9) is extremal whenever 

(4.10) b£ W+(l,n-l). 

Proof. We follow an argument from the paper [BBDD], where a stronger (than the 
extremality) property was proved for £ as in (4.9) under the stronger (than (4.10)) 
assumption b ^ W~(l, n — 1). 

The goal is to prove that for any v > n, the set 



(4.11) <^ x e 



\p + q x + qihx H h q n -ib n -ix\ < ||q|| v 

for infinitely many q = (q , qi, . . . , q n -i) T G Z n , p G Z 



has measure zero. Clearly without loss of generality one can restrict x to lie in the 
unit interval. Also, our usual notation b = (1, b) will be helpful, since the left hand 
side of the inequality in (4.11) will be then written as \p + (bq)x|. 

Let us now state a lemma from which the desired result will easily follow. For 
b G and Q,v > 0, define the set A(h,v,Q) to be the set of x G [0,1) for 

which the inequality 

(4.12) \p+(bq)x\ < Q~ v 

holds for some p G Z, q G Z n with Q < ||q|| < 2Q. 

Lemma 4.8. For any h satisfying (4.10) and any v > n there exists a positive 
constant C = C(b, v) such that for any Q > 1 one has 

\A(b,v,Q)\ < CQ^ . 



It is easy to see that the intersection of the set (4.11) with [0, 1) is contained in 
(4.13) {x | x G A(b, v, 2 k ) for infinitely many k G N} . 

Assuming Lemma 4.8, one has |^4(b, v, 2 k ) \ < C2~ 2L ^ llk M k, and the fact that the 
set (4.13) has measure zero is then immediate from the Borel-Cantelli Lemma. □ 
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It remains to write down the 



Proof of Lemma 4-8. Define -4°(b, v, 2 k ) to be the set of x G [0, 1) for which (4.12) 
holds for some q G Z n with Q < ||q|| < 2Q and with p = 0. It is contained in a 

union of intervals of the form [0, ^tj]- Due to (4.10), there exists c = c(v) > 

such that the denominator of the above fraction is not less than 

c-max(gi,...,g„_i) 2 >c-||q|| 2 < c • (2Q) 2 . 

Therefore one has 

|^°(b,^,Q)| < c- 1 Q- v (2Q)^ = c-^Q^ . 

Now let us estimate the measure of A(b, v, Q) \A°(b, v, Q). Note that, assuming 
p ^ 0, inequality (4.12) can be solvable in x G [0, 1) only if |bq| > 1 — Q~ v > 
1 — 1/(5 > 1/2- For fixed p and q, (4.12) defines an interval of length at most 2^^-, 
and, for fixed q, the number of different centers of those intervals, that is, points 
is at most 1 + Ibql. Therefore one can write 

|bq| 

\A(b,v,Q)^A°(b,v,Q)\ < ^(l + |bq|)= 2Q-* £ (i + J-) 

||q||<2Q, |bq| ||q||<2Q, V ^ ' 

|bq|>l/2 |bq|>l/2 

< 2Q- v (AQ) n + 2Q~ v J2 7^-7- 

l|q||<2Q, ' bq l 
|bq|>l/2 

To estimate the sum in the right hand side of the above formula, note that for 
fixed qi, ... , q n -\ and variable qo, the values of bq form an arithmetic progression. 
Thus, fixing qi, . . . , q n -i, one gets 

1 / 1 1 1 



lko||<2Q, 
|bq|>l/2 



|bq| ~ VV 2 1/2 + 1 1/2 + 2Q-1 

= 4 ( 1 + 1 + ---+4^t) <4(l + log(4Q-l)). 

Summing the above estimate over all q±, ... , q n -i, one obtains 
|^(b, v, Q) \ -4°(b, v,Q)\ < 2 2n+1 Q n ~ v + SQ-^AQ^- 1 (l + log(4Q - 1)) , 



which is not greater than the right hand side of the desired inequality for an ap- 
propriate value of C . □ 
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5. Multiplicative approximation 

The dynamical approach to Diophantine problems described above has an ad- 
vantage of being quite general to allow various modifications of the set-up. In 
particular, most of the ideas described in this paper work for the so-called multi- 
plicative approximation. Let us briefly list all the relevant definitions. For x G M n 
we let 

n 

II + (x) = J J \xi\ + , where = max(|x|, 1) . 

i=i 

For v > let us denote by WM. v (l,n) the set of row vectors y G lR n for which 
there are infinitely many q G Z n such that 

(5.1) |yq + p| < n + (q)" u/n for some pEZ. 

Since the right hand side of (5.1) is not less than that of (1.1), one clearly has 
WM v (l,n) D W v (l,n); in particular, WM n (l,n) = R n by Dirichlet's Theorem. 
Also it can be shown using the Borel-Cantelli Lemma that the Lebesgue measure 
of WAi v (l,n) is zero whenever v > n. Therefore, with the definition of very 
well multiplicatively approximable (VWMA) vectors as those y G M. n which are in 
WM v (l 7 n) for some v > n, one has that almost all y 6l" are not VWMA. 

Let us now say, following the terminology of [Sp4] , that a submanifold M. of M. n 
(resp. a smooth map f from an open subset U of R d to R n ) is strongly extremal if 
almost all y G Ai (resp. f(x) for a.e. x G U) are not VWMA. It is clear that strong 
extremality implies extremality, and to prove a manifold to be strongly extremal 
is usually a harder task than just to prove extremality. For example, the strong 
extremality of the curve (1.2) (that is, the multiplicative analogue of Mahler's 
problem) was conjectured by A. Baker in 1975 [B], and the only proof that exists 
as of now is based on the dynamical approach of [KM1]. In fact, the main result 
of the latter paper ([KM1, Theorem A], of which Theorem 1.1 is a special case) is 
the strong extremality of manifolds nondegenerate in W 1 (in the analytic case this 
was conjectured in [Sp4]). 

With the help of the approach developed in [KM1], let us now try to investigate 
multiplicative approximation properties of generic points of proper affine subspaces 
and their submanifolds by first describing the set of VWMA vectors in a dynamical 
language. It turns out that the actions that are relevant for this case are multi- 
parameter. Namely, one replaces (2.1) by 

gt =diag(e t ,e- tl ,...,e- t "), 

n 

where t = (t\ , . . . , t n ) , t j > 0, and t = t j . 

i=\ 

The latter notation is used throughout the section, so that whenever t and t appear 
in the same context, t stands for Y17=i 

For the rest of this section, we mostly sketch our argument, as it is very similar 
to what is done in §§2-4, and only highlight important modifications. The following 
is a multi-parameter version of Lemma 2.1: 
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(5.2) 



Lemma 5.1. Suppose we are given a set E of pairs (x,z) e M n+1 , which is dis- 
crete and homogeneous with respect to positive integers. Then the following are 
equivalent: 

[5.1-i] for any v > n there exist (x, z) e E with z arbitrarily far from such that 

(5.3) \x\ <Tl + (z)- v / n ; 

[5.2-ii] for any 7 > there exists an unbounded set of t e /or which one has 

(5.4) e^xl^e -7 * and e~ u \z t \ < e~ 7 *, i = 1, ...,n. 

/or some (x, z) e -E 1 \ {0}. 

Proof. We follow the lines of the proof of Lemma 2.1. Assuming [5.1-i], take v > n 
and (x,z) G -E 1 satisfying (5.3), and define t by 

(5.5a) e^ 1 " 7 ^ = II + (z), 

where 7 < 1/n is as in (2.5). Then for every i define ti by 

(5.5b) e u = e^\ Zl \ + . 

Note that, since \zi\ < \zi\ + , this implies e~ li \zi\ < e -7 *; and note also that multi- 
plying all the equalities (5.5b) and comparing the result with (5.5a) one can verify 
that t = Y^i=i ^i- Then one has 

e*|x| <e t UJz)- v / n = e *(e (1 - n7)t )-" /n = e" 7 * , 

(2.5) 

that is, (5.4) is satisfied for this choice of x, z and t; taking z with arbitrarily large 
Il + (z) produces arbitrarily large values of t. 

Assume now that [5.1-ii] holds, and take 7 < 1/n. Then one can find an un- 
bounded sequence of vectors t and a sequence of points (x, z) e E \ {0} satisfying 
(5.4). Since for any t one has t{ > for at least one i, passing to a subsequence 
and reshuffling the coordinates of t and z if necessary, one can assume that for 
some k = 1, . . . , n and all entries t of that sequence, one has 

(5.6) ti > 7t for % < k, and ti < -ft for i > k . 

It follows from (5.6) and (5.4) that \zi\ + < e* i_7 * for i < k, and \zi\ < 1 for i > k, 
hence 



(5.7) n + (z) < Yl \ Zi \+ < e tl+ -+ tfe - fc7t < e (1 " 



-k-y)t 



i=l 



Now it is time to find an appropriate v. However, because of an extra parameter 
k, we have to modify (2.5), namely define v > n by 

v — n nil + 7) 

5.8 7=- ^ ^=-r — T^- 

kv + n 1 — k'f 
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Then the right hand side of (5.7) is equal to e"^ 1+7 ^, and (5.4) implies 



\x\ 



< e~^ 1+7 ^ = ( e ( 1 ~ k 'y) t ')— v / n < n + (z)~"/ n 

(5~7) 



which is exactly what was needed. After that, as in the proof of Lemma 2.1, 
one notices that a uniform bound on ||z||, by the discreteness of E, implies that 
(0, zq) G E for some zq, and integral multiples of (0, zq) give infinitely many (x, z) G 
E satisfying (5.3). To finish the proof, it remains to observe that (5.8) forces v to 
tend to n uniformly in k as 7 — > 0. □ 

Corollary 5.2. For y G M n and g t as in (5.2), i/ie following are equivalent: 
[5.2-i] y is VWMA; 

[5.2-ii] for some 7 > there exists an unbounded set of t G such that 
(5.9) 5(^ y Z" +1 ) <e" 7t ; 

[5.2-iii] for some 7 > i/iere exist infinitely many t G Z™ snc/i £/ia£ (5.9) holds. 

Note that Corollary 2.2 in [KM1] provides the implication [5.2-i] =>- [5.2-iii]. 

Proof. Taking E 1 = -u y Z n+1 , one sees that (5.9) amounts to the validity of (5.4) for 
some (x,z) G E \ {0}. The rest of the argument mimics the proof of Corollary 
2.3. □ 

From the above corollary and Theorem 3.5 it is not hard to derive multiplicative 
analogues of extremality criteria of §3 and §4. The crucial condition to consider is 
an analogue of [3.6-ii]: if B C R d is a ball and f a map from B to M n , it is important 
to check whether or not 

V/3 > 3T = T{(3) > such that for any tel" with t > T 

(^•10) an( j an y p ^ <S n +i, one has sup ||<7tWf( x )r|| > e~^ . 

x6-B 

The following can be proved by a straightforward repetition of the argument of §3: 

• (cf. Lemma 3.7) if (5.10) does not hold, then f(x) is VWMA for all x G B; 

• (cf. Lemma 3.6) if f is continuous, defined on 3 n+1 B and satisfies [3.6-i] and 
(5.10), then f(x) is not VWMA for a.e. x G B. 

Therefore one has 

Theorem 5.3. For U and f as in Theorem 3.8, the following are equivalent: 

[5.3-i] the set {x G U \ f(x) is not VWMA} is dense in U; 

[5.3-ii] f is strongly extremal (that is, the above set has full measure); 
[5.3-iii] for a.e. x G U and any r > there exists a ball B C U centered at x of 

radius less than r satisfying (5.10); 
[5.3-iv] any ball B C U satisfies (5.10). 

In order to express (5.10) in Diophantine language, we need some more notation: 
for t G and / C {0, . . . , n}, let 

t/= £ U. 

ie/\{o} 
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Then, taking f , B, {1, gi, . . . , g s } and P as in Proposition 3.9, one can observe that 
(5.10) can be written in the form 

V/3 > 3T > such that Vt G R+ with t >T and Vw6 
^■"''"''^ one has max fe t_t/ max llPc, w ||, e - * 7 max \wA) > e~^ 1 . 

Thus Theorem 5.3 implies 

Theorem 5.4. Let U , f, g and P be as in Theorem 4-1- Then (5.11) is equivalent 
to each of the following conditions: 

[5.4-i] the set {x G U | f(x) is not VWMA} is non-empty; 
[5.4-ii] f is strongly extremal (that is, the above set has full measure). 

Taking P of the form P = ( I s+1 A), one deduces 

Theorem 5.5. The following are equivalent for an s-dimensional affine subspace 
C o/R n parametrized as in (1.4) with A G M s+ i jTJ ,_ s : 

[5.5-i] C contains at least one not VWMA point; 
[5.5-ii] C is strongly extremal; 

[5.5-iii] any smooth submanifold of C which is nondegenerate in C is strongly ex- 
tremal; 

[5.5-iv] for any (3 > there exists T > such that for any t e wii/i t >T , and 
any w G iS n +i one /ms 

(5.12) max (rnaxe* - * 7 llct + Ac7 II, maxe _t/ IwJ) > e _/3 * . 

v oei" ' ' o^i ' 

This, in particular, proves Theorem 1.4, as well as gives a criterion for the strong 
extremality of C written in terms of Diophantine properties of the parametrizing 
matrix A. 

In general, it seems to be a hard problem to simplify condition [5.5-iv]. However, 
in view of computations made in §4, this can be easily done in the case s = n — 1, 
that is, when £ is a codimension one subspace. Namely, due to Lemma 4.6, one 
knows that (5.12) always holds when s = n — 1 and w e <S n +i j with j > 1; thus it 

suffice, to ha.dk the CaS e J = 1. A S in § 4, one then nses v = (j) e Z«« x {0} 

in place of w (here p G Z n and g 6 Z) and notices that c J v is defined only for 
I = {0}, with c {0} v = v, C{" 0} v = p and c^" 0} v = q. Replacing A by a column 
vector a = (ao, ai, • • • , a n _i) T G and letting p = (p ,Pi, • • • ,Pn-i) T and 

p' = (p 1 ,...,p n - 1 ) T , one gets 

Corollary 5.6. The following are equivalent for an (n — 1)- dimensional affine 
subspace C ofM, n parametrized by (4.8): 

[5.6-i] C is strongly extremal; 

[5.6-ii] for any (3 > there exists T > swc/i t/iat /or any t G R" wit/i t > T and 
any (^j G Z n+1 \ {0} one /ias 

max (e* ||p + aq\\, e~ fl |pi|, . . . , e"*™" 1 |p n _i |, e _t ™ > e _/3t . 
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[5.6-iii] for any v > n there exists K > such that for any p G Z n and q G Z 
max(||p'||, |o|) > one has 

||p + aiz||>II + (p', ? )-»/». 

Proof. The only part that requires a comment is the equivalence [5. 6-ii]<£> [5.6-iii], 
which is a special case of Lemma 5.1 with 

£ = {(||p + ag||,p',g) |peZ",gGZ}. □ 

Corollary 5.7. Let £ 6e parametrized by (4.8), and iei 

fc = #{1 < i < n - 1 | a< + 0} . 
T7ien £ is strongly extremal iff a ^ VV^" +1 (n, 1). 

Proof. By the previous corollary, the fact that £ is not strongly extremal is equiv- 
alent to saying that for some v > n there exist (p, q) G Z n with arbitrarily large 
max(||p'||, \q\) such that 

(5.13) ||p + ao|| <n + (p',g)-^. 

Equivalently, there exists a sequence of solutions of (5.13) with pi arbitrarily close 
to a,iq for alH = 1, . . . , n — 1, which in particular happens if and only if pi is equal 
to zero for any i with a; = 0. Hence the ratio of n + (p / , g) and |g| is bounded 
from both sides, and, slightly changing v, one gets infinitely many solutions of 
||p + ag|| < |g|-^ fc+1 ). □ 

In particular, one can see that the set of vectors a G W 1 for which the map 
(4.8) is not strongly extremal is slightly bigger than the one corresponding to non- 
extremality, agrees with the latter outside of all the coordinate planes, and still has 
Hausdorff dimension 1. 

6. Further generalizations and open questions 

6.1. As was mentioned before, it would be very interesting to find out whether 
Theorem 1.3 can be extended to the cases when the rank of A is greater than 
one. If it cannot, it would be nice to find a reasonable description of the set of non- 
extremal subspaces, e.g. such that would allow to compute its Hausdorff dimension. 
Similar questions are open in the case of multiplicative approximation; in particular, 
it is not clear, except for the case of hyperplanes, how much smaller than the set 
of extremal subspaces is the set of strongly extremal ones. 

6.2. Given that one of the main results of this paper is that the extremality of an 
affine subspace is inherited by its nondegenerate submanifolds, one can ask whether 
any 'passage of information' takes place when the subspace is not extremal. Let 
us introduce the following definition: for A G M m n (IR), define the Diophantine 
exponent u(A) of A by 

oj(A) d = sup{i; | A G W„(m, n)} . 
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Clearly n/m < u(A) < oo for all A, and A is VWA iff u(A) > n/m. Now, for a 
map f : U — > lR n define u>(f) to be the essential infimum of cu(f(-)), i.e. 

w(f) d =sup|z; | |{xeC/| f(x) £ W„(1,ti)}| > 0} . 

Naturally, if Af is a smooth manifold, we let the Diophantine exponent a; (At) of 
At to be the Diophantine exponent of its parametrizing map. 

Clearly a manifold (map) is extremal iff its Diophantine exponent is the smallest 
possible, i.e. is equal to n = cu(lR n ). Now the following two questions arise: 

• is it true that the Diophantine exponent of an affine subspace C of M. n is 
inherited by manifolds nondegenerate in £? 

• how to efficiently describe the class of affine subspaces with a given Dio- 
phantine exponent? 

The answer to the first question is 'yes', which can be proved by a refinement of 
the 'dynamical' approach developed in [KM1] and the present paper. Furthermore, 
similarly to Theorems 3.8 and 4.2-4.3, for any v > n one can write down necessary 
and sufficient conditions for a good (resp. affine) map f to have a>(f) < v. 

The answer to the second question is as obscure as the problem of extending 
Theorem 1.3 beyond the cases (1.5) and (1.6). Indeed, the conclusion of the latter 
theorem amounts to saying that, for £ parametrized as in (1.4), u(A) < n implies 
lo(jC) = n. In general, one can easily prove that u(C) > u(A); thus one is left 
to ask whether uj(C) is always equal to max (n,u(A)) (th is can be verified in the 
'rank-one' cases (1.5) and (1.6)). All this is going to be the topic of a forthcoming 
paper [K2]. 

6.3. Even more generally, one can replace the right hand side of (1.1) by an arbi- 
trary function of ||q||. Let us specialize to the case of row vectors and introduce the 
following definition: for a non-increasing function ip : N — > (0, 00), define W^(l,n) 
to be the set of y 6 1" for which there are infinitely many q £ Z n such that 

||yq + p\\ < V'GIqII) for some p £ Z . 

It is a theorem of A. V. Groshev ([G], see also [SI]) that almost no (resp. almost 
all) y £ R n belong to W^(l, n) if the series 

00 

(6.1) $> n -V(^) 

k=i 

converges (resp. diverges). (The case of convergence easily follows from the Borel- 
Cantelli Lemma.) It has been recently proved, see [BKM] for the convergence part 
and [BBKM] for the divergence part, that the same dichotomy takes place for any 
nondegenerate submanifold of M n ; in other words, for a smooth map f : U — > M n 
which is nondegenerate in M. n , one has 

f (x) £ Wt/,(1, n) for almost no (resp. almost all) x £ U 

^ ^ if the series (6.1) converges (resp. diverges). 
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It is natural to expect a similar dichotomy for any smooth sub manifold M. of R n , 
with the convergence/divergence of (6.1) replaced by another 'dividing line' condi- 
tion, possibly involving the Diophantine exponent of Ai. The following problems 
remain open: 

• is it true that the aforementioned 'dividing line' condition of an affine sub- 
space C of R n is inherited by manifolds nondegenerate in £? 

• given an affine subspace, find its 'dividing line' condition; or, vice versa, 
describe the class of subspaces with a given 'dividing line'. 

The only result along these lines known to the author is the paper [BBDD], where 
it is shown that the convergence/divergence of (6.1) serves as the 'dividing line' con- 
dition for one-dimensional subspaces of R n of the form (1.6) with b ^ W~(l, n — 1). 

Finally, let us note that the paper [BKM] also contains a more general (in particu- 
lar, multiplicative) version of the convergence case of (6.2), and it is of considerable 
interest to see if the argument from that paper can be applied to the set-up of 
proper affine subspaces and their nondegenerate submanifolds. 
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